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Rhabdomyosarcoma (RMS) is the most commonly occurring type of soft tissue tumor in children. However, 
it is rare in adults, and therefore, very little is known about the most appropriate treatment strategy for adult 
RMS patients. We performed genomic analysis of RMS cells derived from a 27-year-old male patient whose 
disease was refractory to treatment. A peritoneal seeding nodule from the primary tumor, pleural 
metastases, malignant pleural effusion, and ascites obtained during disease progression, were analyzed. 
Whole exome sequencing revealed 23 candidate variants, and 10 of 23 mutations were validated by Sanger 
sequencing. Three of 10 mutations were present in both primary and metastatic tumors, and 3 mutations 
were detected only in metastatic specimens. Comparative genomic hybridization array analysis revealed 
prominent amplification in the 12ql3-14 region, and more specifically, the CDK4 proto-oncogene was 
highly amplified. ALK overexpression was observed at both protein and RNA levels. However, an ALK 
fusion assay using NanoString technology failed to show any ALK rearrangements. Little genetic 
heterogeneity was observed between primary and metastatic RMS cells. We propose that CDK4, located at 
12ql4, is a potential target for drug development for RMS treatment. 



Rhabdomyosarcoma (RMS) is the most commonly occurring type of soft tissue tumor in children', but it is 
less common in adults, accounting for only 2-5% of all soft tissue sarcomas^. Given the rarity of this disease, 
very little information is available on the most appropriate treatment strategy for adult RMS patients. 
Patients with unresectable or metastatic RMS have an extremely low cure rate and a poor prognosis'''*. A 
substantial improvement in survival has been achieved with the introduction of intensive chemotherapy regi- 
mens, which are usually based on pediatric oncology clinical trials on RMS'" However, survival rates for patients 
with metastatic disease remain disappointing, and the prognosis is dismal in patients with a poor response to 
salvage chemotherapy'"''". Thus, identification of novel therapeutic targets in RMS is urgently needed in order to 
improve treatment outcomes for this aggressive type of tumor. 

Major histologic subtypes of RMS include embryonal RMS (ERMS) and alveolar RMS (ARMS)". Despite 
advances in therapy, patients with the ARMS histological variant of RMS have a 5-year survival of less than 30%. 
ARMS presents with distinctive chromosomal translocations that result in specific fusion gene products, the most 
prevalent of which are PAX3-F0X01 (55%) and PAX7-FOX01 (22%)'^. Reciprocal translocation of chromo- 
somes 2 and 13 results in a PAX3-PKHR fusion gene in ARMS, which fuses the region of the gene encoding the 
DNA-binding domain of the transcription factor PAX3 with that encoding the transactivation domain of the 
transcription factor FKHR in-frame (3, 4). However, at least 25% of ARMS cases lack such translocations, 
suggesting that ARMS is not a single disease, but a heterogeneous group of conditions with a common phenotype. 
Moreover, studies on the gene expression profile of RMS have proposed new molecular classifications''' and have 
revealed that a specific gene expression signature potentially determines tumor behavior as well as treatment 
outcome'''"'. ALK is one of the targets of interest, given that ALK alterations are relatively common in RMS, 
although the function of its gene product remains unknown'^. 



SCIENTIFIC REPORTS | 4:3623 | DOI: 1 0. 1 038/srep03623 



1 



Here, we report the clinical application of genomic profiling in 
identifying potential novel genetic mutations in patients with 
relapsed and chemotherapy-refractory alveolar RMS. 

Results 

Case presentation. A 27-year-old man presented with a complaint of 
left upper quadrant abdominal pain that had lasted for 3 years. 
Computed tomography (CT) and positron emission tomography 
(PET) scans showed multiple malignant masses, involving the 
pancreas and left upper abdominal wall, and pleural seeding was 
also noted (Fig. lA). Pathological examination of the abdominal 
wall mass showed thin fibrous septae lined by small round blue 
cells in an alveolar growth or solid pattern; the cells appeared to 
lack cohesion and had hyperchromatic nuclei and scant cytoplasm. 
The tumor cells were diffusely positive for CD99, desmin, and WTl 
and showed scattered focal positivity for cytokeratin. Ki-67 staining 
revealed high proliferative activity of the tumor cells. The FKHR 
break-apart fluorescence in situ hybridization (FISH) showed 



separate green and red signals, confirming FKHR rearrangement 
(Supplementary Fig. SI). 

Based on the histology, immunohistochemistry (IHC), and FISH 
results, ARMS was diagnosed, and alternating cycles of vincristine, 
doxorubicin, and cyclophosphamide (VDC) and ifosfamide and eto- 
poside (IE) were administered every 3 weeks. 

After completing a 1-year course of cytotoxic chemotherapy, the 
patient achieved near complete remission, with disappearance of the 
multiple masses and pleural seeding (Fig. IB). On the basis of a tumor 
board discussion involving a multi-modality team for sarcoma, the 
residual peritoneal seeding nodules were surgically resected. At the 
time of surgery, the resected seeding nodules were snap frozen and 
immediately stored at — 80°C for molecular analysis. The pathologic 
examination of the resected peritoneal seeding nodules verified the 
diagnosis of ARMS. 

Postoperative follow-up abdominal pelvis CT and chest CT 
demonstrated no evidence of malignancy. However, 3 months after 
surgical resection, the patient was found to have developed a chest 




Yellow: abdominal wall mass 
Green: pancreas mass 



B 




Figure 1 | Computed tomography (CT) findings during the course of the disease. (A). Abdominal wall mass (yellow) and pancreas mass (green) at the 
time of initial diagnosis. (B). After l"-line chemotherapy, the disease virtually disappeared. (C). A massive amount of left pleural effusion was seen 
after salvage chemotherapy. 
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wall mass of approximately 2 cm in size. Salvage etoposide, ifosfa- 
mide, and cisplatin (VIP) chemotherapy was administered and ini- 
tially elicited a partially positive response. However, soon afterward, 
the patient developed rapidly progressive disease with a massive 
amount of left pleural effusion after the 5th cycle of VIP (Fig. IC). 
Because dyspnea was caused by rapidly increasing pleural effusion, 
talc pleurodesis and pleural biopsy were performed through video- 
assisted thoracoscopic surgery (VATS). At this time, the patient 
agreed to fuU genetic testing using the tissue specimens to identify 
potential molecular targets (sample #2). 

In addition, malignant cells from the pleural effusion were cul- 
tured and stored at — 80^C for molecular analysis and in vitro drug 
sensitivity tests (sample #3). However, the patient developed malig- 
nant ascites (sample #4) immediately after pleurodesis, and 4 cycles 
of paclitaxel and ifosfamide were administered. The patient 
developed cord compression with paraplegia, experienced continued 
disease progression, and died several weeks later. 

Genomic profiling and somatic mutation. DNA from the primary 
tumor (sample #1) was sequenced, revealing 23 candidate variants: 
SCNIB, PPP1R3A, GRID2, APBA2, ZNF142, ZYGllA, RBFOXl, 
TCF7L1, NARF, KIAA0182, TEX13B, MUC2, LRRC3, GRHL3, 
MUC16, TTR, UBAl, FENl, ELAC2, NBFALl, DSCAMLl, PCD- 
HA4, and P0LR3C (Table 1). Only the MUC16 mutation was de- 
tected in blood, with 3.4% allele frequency. Amino acid substitutions 
were predicted to arise from some of the point mutations in each 
gene, and these in turn were predicted to have a substantial pheno- 
typic effect based on the SIFT score (a SIFT score of 0 indicates a 
deleterious effect, a score £ 0.05 indicates a damaging effect, and a 
score > 0.05 suggests that the substitution can be tolerated). These 
protein mutations were also predicted to have considerable func- 
tional impact based on the FI score as determined by Mutation 
Assessor (http://mutationassessor.org) (an FI score £ 0.8 is 
considered neutral; 0.8 < FI score £ 1.9 indicates low impact; 1.9 



< FI score £ 3.5 indicates medium impact; and FI score > 3.5 
indicates high impact). Among the variants detected in exome 
sequencing from the tumor specimen, those in SCNIB, PPP1R3A, 
GRID2, APBA2, ZNF142, ZYGllA, RBFOXl, TCF7L1, TEX13B, and 
DSCAMLl were validated by Sanger sequencing, and the details of 
these 10 candidate genes are provided in Table 2. 

The primary tumor (sample #1), pleural metastases (sample #2), 
and malignant cells from ascites (sample #4) were all found to carry 
point mutations in SCNIB, PPP1R3A, and ZYGllA. However, 
APBA2, ZNF142, and RBFOXl mutations, although not present in 
the primary tumor, were detected in both metastatic (chemotherapy 
refractory) specimens. TCF7L1, TEX13B, and DSCAMLi mutations, 
which were detected during exome sequencing, were not confirmed 
in subsequent Sanger sequencing of the primary tumor or any meta- 
static specimens. Although the mutation in GRID2 was not seen in 
the primary tumor due to failure of the sequencing reactions, the 
mutation was confirmed in both metastatic samples. 

Comparative genomic hybridization array analysis of the primary 
tumor. Although several chromosomal regions showed evidence of 
copy number variations (CNVs; Supplementary Table SI), the 
12ql3.3-ql4.1 region demonstrated the highest level of chromoso- 
mal amplification (Fig. 2A). As this region contains multiple genes, 
we analyzed the amplification of each gene (Table 3). Within this 
region, the CDK4 proto-oncogene was highly amplified, and several 
other genes in the amplicon also showed various degrees of amplifi- 
cation, including NACA, HSD17B6, SDR9C7, RDH16, GPR182, 
ZBTB39, TAC3, MYOIA, NAB2, STAT6, and LRPl. The overexpres- 
sion of CDK4 at the protein level was also confirmed by IHC 
(Fig. 2B). 

ALK fusion assay. As ALK overexpression has been reported in re- 
fractory RMS'"'", we assessed the level of ALK protein expression 
using IHC. As shown in Fig. 3A-1, ALK IHC was strongly positive in 



Table 1 | Somatic variants detected in rhabdomyosarcoma (i?MS; primary tumor, sample #1 ) by whole exome sequencing 



No 


Ch 


position 


gene name 


ref 


var 


vac 


tr 


vrf 


vac 


tr 


vrf 


Impact 


fi score 


prediction 


score 


1 


19 


35524413 


SCNIB 


A 


G 


0 


35 


0.0% 


1 1 


21 


52.4% 


low 


1.865 


D 


0 


2 


7 


11 35 19206 


PPP1R3A 


A 


T 


0 


102 


0.0% 


66 


142 


46.5% 


neutral 


0 


D 


0.03 


3 


4 


9441 1875 


GRID2 


C 


A 


0 


61 


0.0% 


31 


67 


46.3% 






D 


0 


4 


15 


29346353 


APBA2 


G 


A 


0 


62 


0.0% 


23 


53 


43.4% 


medium 


2.28 


D 


0.02 


5 


2 
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ZNF142 


C 


T 
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9 


21 


42.9% 


low 
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T 


0.3 


6 


1 
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ZYGl lA 
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18 


43 
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D 
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8 
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TCF7L1 
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T 
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9 


17 
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NARF 


A 


G 


0 
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8 


28 


28.6% 
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0.455 


D 


0.03 


10 


16 
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KIAA0182 


T 


A 


0 


24 


0.0% 


4 


16 


25.0% 
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1.32 


D 


0.01 


1 1 


X 


107224952 


TEX13B 


G 


A 


0 


47 


0.0% 


12 


54 


22.2% 


low 


1.5 


T 


0.06 


12 


1 1 


1093375 


MUC2 


C 


T 


0 


26 


0.0% 


4 


18 


22.2% 






T 


0.23 


13 


21 


45877015 


LRRC3 


A 


C 


0 


28 


0.0% 


4 


20 


20.0% 


low 


0.865 


T 


0.1 1 


14 


1 


24663626 


GRHL3 


A 


C 


0 


26 


0.0% 


4 


22 


18.2% 


low 


1.67 


D 


0 


15 


19 
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MUC16 


A 


C 


2 


58 


3.4% 


8 


44 


18.2% 
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1.935 


T 


0.13 


16 


18 
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TTR 


G 
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D 
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0 
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3.34 


D 


0 
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1 1 
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FENl 
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G 


0 


29 


0.0% 


5 


32 


15.6% 
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2.83 


D 


0 


19 


17 
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ELAC2 


A 


C 


0 


26 


0.0% 


4 


26 


15.4% 
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1.87 


D 


0 


20 
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204045 1 8 1 
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C 


0 
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0.0% 
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4.49 


D 


0 


21 


1 1 
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DSCAMLl 


A 


C 


0 


37 


0.0% 


5 


37 


13.5% 


medium 


2.93 


D 


0 


22 


5 


140188268 


PCDHA4 


T 


G 


0 


39 


0.0% 


5 


37 


13.5% 


medium 


2.905 


D 


0 


23 


1 


145601821 


POLR3C 


G 


C 


0 


39 


0.0% 


5 


37 


13.5% 


low 


1.39 


D 


0.04 



Ch, chromosome; ref, reference; var, variant; vac, variant allele count; tr, total read; vrf, variant read frequency; D, damaging; T, tolerated; SIFT, Sorting Tolerant From Intolerant (Nucleic Acids Res 
2003;31 :3812). 

Nos. 1 , 2, 3, 4, 5, 6, 7, 8, 1 1 , and 21 : somatic mutations that were validated by Sanger sequencing (Table 2). 
No. not mentioned above: somatic mutations that were not validated by Sanger sequencing. 
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most of the tumor cells. Given the high level of ALK protein 
expression, ALK RNA overexpression was also detected in the 
pleural metastasis specimen (sample #2), as expected (Fig. 3B). 

Two lung cancer cell lines, NCIH3122 and NCIH2228, were used 
as positive controls for EML4-ALK fusion and A549 cells were used 
as the negative control. ALK-fusion lung cancer only overexpresses 
the 3'-AIKmRNA (NCIH3122 and NCIH2228), whereas sarcoma 
overexpresses the full-length mRNA. The mean of 3' -ALK express- 
ion of ALK* lung tumor is approximately 500, whereas this reached 
approximately 3000 in the sarcoma. Despite ALK overexpression at 
the protein and RNA levels, ALK amplification was not observed in 
the tumor cells of this patient. 

Next, we screened for the presence of ALK fusion partners using a 
NanoString assay; however, the ALK fusion assay failed to show any 
ALK rearrangements (Fig. 3C). ALK rearrangement was also not 
detected by FISH (Fig. 3A-3). 

Discussion 

Patients with recurrent RMS usually present with a rapidly deteri- 
orating condition and have markedly limited options in terms of 
chemotherapy'^'^". In this study, we found that the majority of somatic 
mutations found during exome sequencing of primary tumor tissue 
were also observed in metastatic tumor tissue and metastatic cells in 
ascites samples, although Sanger sequencing revealed genetic altera- 
tions involving several genes, such as APBA2, ZNP1142, and 
RBFOXl, only in the metastatic samples. These results led to 2 
important conclusions: First, there is little genetic heterogeneity 
between primary and metastatic RMS cells at least in terms of muta- 
tional spectra, reflecting relatively little genetic evolution during the 
course of metastasis. This is consistent with the results of recent 
similar studies on melanoma"", breast^'', and pancreatic cancers^', 
in which genomic profiling for both primary and metastatic sites 
was performed. Second, we confirmed that malignant cells isolated 
from body fluid can be used for genomic profiling, as their genome is 
nearly identical to that of the resected tumor specimen. This may be 
especially important in clinical practice because body fluid can be 
obtained relatively easily using a bedside procedure. Of the 23 can- 
didate genes found during exome sequencing, we selected 9 mutated 
genes {APBA2, RBFOXl, TCF7L1, MUC16, UBAl, FENl, NBEALl, 
DSCAMLl, and PCDHA4) for which substantial functional impact 
was predicted (medium or high functional impact; FI score > 1.9), 
with or without damaging/deleterious phenotypic effects based on 
the SIFT score (so.05), and assessed their clinical relevance to RMS. 
However, we could not find any pre-existing evidence that these 
genetic alterations contribute to RMS development. 

The aCGH array used in this study revealed prominent amplifica- 
tion in the 12ql3 and 12ql4 regions. Although we found that many 
genes within this chromosome 12 region were amplified, CDK4 amp- 
lification was of particular interest because it is known to play a 
pivotal role in the oncogenic process^**, and perhaps more impor- 
tantly, the corresponding proteins are potential drug targets^^. Its 
overexpression is frequently observed in well-differentiated and ded- 
ifferentiated liposarcomas^*""^"; consequently, a clinical trial of the 
CDK4 inhibitor (PD0332991) for CDK4-amplified tumors has been 
conducted. In both phase I and phase II trials, the CDK4 inhibitor 
has proven effective in C, and a randomized phase 3 trial is being 
considered by researchers^'. Amplification of 12ql3-ql4 and CDK4 
in RMS has been reported previously'"'^', as has amplification of 
MYCN, and both of these genes are known to be involved in RMS 
tumorigenesis'^. However, these genes are associated with distinct 
expression profiles and clinical parameters. MYCN overexpression 
occurs more frequently in cases in which 2p24 amplification is pre- 
sent, whereas CDK4 overexpression is associated 'with 12ql3-14 
amplification^'. In addition, 12ql3-14 amplification was signifi- 
cantly associated with poor clinical outcomes, such as short 
failure-free and overall survival, compared to that seen in cases with 
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Figure 2 | Array comparative genomic hybridization (aCGH) of the primary tvmior and CDK4 overexpression. (A) . aCGH of the primary tumor: the 
12ql3.3-ql4.1 region demonstrated the highest level of chromosomal amplification. (B-1). CDK4 immunohistochemistry of the primary tumor 
confirmed overexpression of CDK4 protein; CDK4 is located within the 12ql3.3-ql4.1 region. (B-2). Negative control for CDK4 immunohistochemistry. 



2q24 amplification". In the RMS case studied here, we confirmed the 
presence of a 12ql3-14 amplification and showed that CDK4 is one 
of the genes overexpressed in this chromosomal region. 

A study provided in vitro evidence for the successful pharmaco- 
logic inhibition of CDK4/CDK6 activity in myoblasts and RMS- 
derived cells^*; in this study, most ARMS-and ERMS-derived cell 
lines and tumor samples expressed CDK4 and CDK6, and exposure 
of these cells to a CDK4 inhibitor caused Gl cell cycle arrest, which is 
closely associated with myogenic differentiation. Given that defective 
cell cycle control, which leads to failure of myogenic differentiation, 
is one of the notable characteristics of RMS-derived cells, it was not 
surprising that CDK4 inhibition with PD0332991 ultimately facili- 
tated skeletal muscle differentiation. This finding suggests that 
CDK4 inhibition is a potential therapeutic strategy for RMS. 
However, there is a scarcity of data on the use of a CDK4 inhibitor 
in patients with RMS. Although it is therefore difficult to draw firm 
conclusions regarding the potential efficacy of this inhibitor, the need 
for novel therapeutics arising from the dismal prognosis in refractory 
RMS, together with the genetic profiling data presented here, warrant 
clinical trials on a CDK4 inhibitor in chemotherapy-refractory RMS 
patients. 

In agreement with previous reports'"", we found that ALK was 
overexpressed in the RMS tumor in the current case. Although ALK 
overexpression is frequently detected in RMS, the mechanisms 
underlying this phenomenon are yet to be defined. However, a 
high-affinity binding site for the PAX3 and FOXOl transcription 
factors in the intron of ALK has been reported to mediate high levels 
of ALK transcription''^, and increases in ALK copy number have also 



been described" '", although this did not always correlate with ele- 
vated ALK protein expression. A recent extensive cohort study on 
ALK aberration in RMS" revealed that approximately 90% of ARMS 
patients and 50% of ERMS patients exhibited ALK copy number 
gains, whereas only 4% of RMS patients showed true amplification 
of ALK. In our study, ALK amplification was not observed, although 
ALK was overexpressed. The results of our study indicate that the 
overexpression of wild-type ALK alone may not be sufficient to drive 
tumor growth and that ALK may therefore not be an effective drug 
target in RMS. Currently, clinical trial NCT # 01121588 (clinical- 
trials.gov) on crizotinib therapy for ALK-positive solid tumor types is 
ongoing. Since our study is limited to one case only, further studies 
are required to elucidate the antitumor efficacy of crizotinib and the 
CDK4 inhibitor in sarcomas in the context of clinical trials. 

In summary, our study revealed that there was little genetic het- 
erogeneity between primary and metastatic RMS cells and suggested 
that malignant cells from body fluid can be used for genomic profil- 
ing of RMS patients. The RMS tumor in this case overexpressed ALK, 
but this was not associated with the amplification or translocation of 
this gene. Prominent amplification of the 12ql3-14 region was also 
observed, and we propose that CDK4, located in 12ql4, is a potential 
target for drugs in RMS. 

Methods 

Ethics statement. This study was approved by the SMC Institutional Review Board 
and was conducted in accordance with the 1996 Declaration of Helsinki. Written 
informed consent was obtained from the patient before genomic analyses were 
performed for research purposes. 



SCIENTIFIC REPORTS | 4:3623 | DOI: 1 0. 1 038/srep03623 



5 



Table 3 | Amplification o 


genes within 


the 1 2ql 3.3-14.1 region 








CytoBand 


Start 


Stop 


Genes 


Description 




Log ratio 


Amplification 


a 


3.3 


O/ 1 1 O/ 1 u 


O/ 1 1 J/ Oo 




nascent polypeptide-associated connplex alpha sl 


burnt 


o nR71 AQ.A'^ 
o.UO/ 1 4o40 


1 0799 1 90 

1 .y/ zz 1 zy 


a 


3.3 


"171 AT97A 

O/ 1 CJOZ/ o 


C"7 1 ATT? '^ 
O/ 1 OOooO 


l-Kni 7RA 
1 # DO 


hydroxysteroid (17-beta] dehydroQenase 6 homo 




9 y1RTylR9A 
Z.40o4oz0 


1 0799 1 90 

1 .y/ zz 1 zy 


n 
M 


3.3 




O/ oL\j/ 04 




short chain dehydroQenase/reductase family 9C, 
member 7 


A 09/10099^ 
4.Uz4yyzzO 


Q c cyp 1 1 c 
0 .00/ olio 


n 
M 


3.3 




^7T /I A7/1 1 


RDM 1 A 


retinol dehydrogenase 1 6 




o.Ozozoy4 


Q c cyp 1 1 c 
0 .00/ olio 


a 


3.3 


O/ O OO J J 7 


^7T HQA 1 A 
0/ OOOO 1 0 


APR 1 ft9 


G protein-coupled receptor 1 82 




A MARCTTOA 
4.UOoOoyO 


Q c cyp 1 1 c 
0 .00/ olio 


a 


3.3 


'i7TQ?r)71 

O/ OtOKJ/ I 


cyTOT 1 Tn 

o/ jyo 1 ou 


7RTR10 


zinc finger and BTB domain containinc| 39 




A RARriOR7 

4.ooouyo/ 


Q c cyp 1 1 c 
0 .00/ olio 


a 


3.3 


^7/ln71 TO 
o/ ^u/ 1 oy 


-Ifiy 1 09 

1 yz 




tachykinin 3 




T m OTROR 

o.u 1 yooUo 


Q c cyp 1 1 c 
0 .00/ olio 


a 


3.3 


'i74?4049 


A T '^noo 


in 1 ^/ 1 H 


myosin lA 




T ar'=;r'^'?t 

o.OoOoOoo 


Q c cyp 1 1 c 
0 .00/ olio 


a 


3.3 


•J/ ^ooo/ / 


0/ 4oO/ o 1 




NGFI-A binding protein 2 (EGRl binding protein 


9* 


T AR7R A AQf^ 
o.Oo/ O44o0 


Q c cyp 1 1 c 
0 .00/ olio 


n 


3.3 


^7/10ym9 
o/ ^y^ooz 


C"7y10y1 TOO 




signal transducer and activator of transcription 6, 


T 0/19 /I RR 1 
o.U4z4oO 1 


Q c cyp 1 1 c 
0 .00/ olio 












interleukin-4 induced 










3.3 




cycoA'^ y| 
j/ J J0j4j 


1 PD1 


low density lipoprotein receptor-related protein 1 




0.4OO0O0OO 


Q c cyp 1 1 c 
0 .00/ olio 


a 


3.3 


'i7A97R'?9 


'^7A97RR 1 
0/ Oz/ Oo 1 


on/v\ 1 z 


serine hydroxymethyltransferase 2 




1 1 0 1 /I OA 
1 . 1 y 1 4UO 


1 0799 1 90 
1 .y/ zz 1 zy 


a 


3.3 


O/ OO 1 ouu 


'^7A'5 1 c;a 

0/ Oo 1 JOO 


Kini IFAyll 9 

INL'UrM^LZ 


NADH dehydrogenase 




1 '3AR'5n91 
1 .oOooUz 1 


1 0799 1 90 

1 .y/ zz 1 zV 


a 


3.3 


'i7A4n'iRS 


cyA/inAT A 


O 1 r\\^0 


SH3 and cysteine rich domain 3 




0 90RA9n7A 

U-zyoOzu/ 0 


0 '5An779A 
U.oOU/ / zO 


q 


3.3 


^7A/)R/lni 
O/ O^O^U 1 


cyA /I Q /I c:o 


i\onu/v\z 


R3H domain containing 2 




rv 1 OTO 1 TO 

u. 1 zjy 1 jy 


n '3An779A 
U.oOU/ / zO 


q 


3.3 


^7PTA/ln9 


cypoA/l A 1 
O/ 0o040 1 




inhibin, beta C 




0 1 A97071 
U. 1 OZ/ U/ 1 


0 '5A0779A 
U.oOU/ / zO 




3.3 


57851519 


cyp c 1 cyp 
0/ OO 1 D/ O 


INHBE 


inhibin, beta E 




U. 1 4VzUzOV 


n '5An779A 
U.oOU/ / zO 


a 


3.3 


57856767 


cyp t^AQ.OA 
D/ oOOOzO 


GUI 


GLI family zinc finger 1 




1 MM9 1 '^07 

1 .UUz 1 jy/ 


M '3An779A 
U.oOU/ / zO 




3.3 


O/ OOO 1 O 1 


57866240 




Rho GTPase activating protein 9 




0.52336573 


0.3607726 


a 


3.3 


•J/ OOZ^/ 1 


57882530 


MARS 


methionyl-tRNA synthetase 




0.62351316 


0.3607726 


a 


3.3 


o/ y 1 1 1 oo 


cyo 1 1 9'3 /I 
0/ V 1 1 Z o4 


\J\J\ \ o 


DNA-damage-inducible transcript 3 




rv 1 r\A 1 7*^9 
U. 1 UO 1 / Oz 


n '3An779A 
U.oOU/ / zO 


q 


3.3 


'=;70T? TOT 


cyOTT /I ^^9 


L/v- 1 INZ 


dynactin 2 [p50) 




0 RRATOAR 
U.OOOoUOO 


0 '5An779A 
U.oOU/ / zO 


q 


3.3 


c;7Q>iQOnA 
•J/ y^oyuo 


0/ V4oyOj 




kinesin family member 5A 




r\ AC\A AOQ.O A 
U.4U44yoz4 


n '5An779A 
U.oOU/ / zO 


n 


3.3 


57988882 


'^yORCOTO 
O/Vooy oV 


PIP4K2C 


phosphatidylinositol-5-phosphate 4-kinase, type II 




n 9T /inARn9 

U.zo4vJOoUz 


n '3A0779A 
U.OOU// ZO 


n 
M 


3.3 




58000436 


L/ 1 AO 


gamma 

deltex homolog 3 (Drosophila) 




0.07017814 


0.3607726 


a 


3.3 




58009302 


wcr 1 


RhoA/RAC/CDC42 exchange factor 




0.29716232 


0.3607726 


a 


3.3 


i^Qni PROS 

vJOU 1 ooy J 


con 1 RO/1 9 
OoU 1 oy4z 


Qi r9AA 1 0 


solute carrier family 26, member 1 0 




0 ROT5 1 
U.oyoo4o 1 0 


0 '5An779A 
U.oOU/ / zO 


q 


3.3 


JOUZOO 1 O 


c pno coyo 
OoUz Jo/ z 


Ryir;AI MTl 


beta-1 ,4-N-acetyl-galactosaminyl transferase 1 




U.O4OU0U 1 0 


M '3An779A 
U.oOU/ / zO 


q 


4.1 


SR 1 1 0909 


CR 1 1 n9/l7 
Oo 1 1 UZ4/ 


woy 


osteosarcoma amplified 9, endoplasmic reticulum 


lectin 


9 RR7'^A 1 R 

z.oo/ 00 1 0 


T 9700000 

0 .z/ uuyyy 


q 


4.1 


581 18971 


c p 1 1 onoA 
Oo 1 1 VUzO 


AGAP2 


ArfGAP v^ith GTPase domain, ankyrin repeat and PH 

-J ;„ O 


T oncoo 1 A 
o.zUOzy 1 0 


T 9700000 

0 .z/uuyyy 


q 


4.1 


00 1 JyVoO 


58140017 




uuniuin z 

tetraspanin 3 1 




2.3439372 


3.2700999 


q 


4.1 


58143212 


58143260 


CDK4 


cyclin-dependent kinase 4 




3.8085285 


3.2700999 


q 


4.1 


58163728 


581 63784 


METTLl 


methyltransferase like 1 




3.1 829998 


3.2700999 


q 


4.1 


58167136 


58167187 


FAMl 19B 


family with sequence similarity 1 1 9, member B 




3.5654607 


3.2700999 


q 


4.1 


58186836 


58186894 


TSFM 


Ts translation elongation factor, mitochondrial 




3.2734551 


3.2700999 


q 


4.1 


58205712 


58205771 


AVIL 


advillin 




3.21 19691 


3.2700999 


Bold: genes 


fhat were detected also 


n exome copy num 


Der variation (CNV) 











IHC. Five-micrometer-thick tissue sections were deparaffinized in xylene, 
reliydrated, and heated to 100 C in citrate buffer (pH 6.0) for 5 min for non- 
enzymatic antigen retrieval. The sections were incubated with monoclonal mouse 
anti-human desmin antibodies {1 : 100 dilution; RLM30; Novocastra, Newcastle- 
upon-Tyne, UK) for 60 min at room temperature, followed by incubation with a 
1 : 1000 dilution of biotinylated goat anti-mouse IgG {Vector Laboratories, 
Burlingame, CA, USA) for 1 h at room temperature. The sections were stained with 
diaminobenzidine chromogen for 5-10 min and were then counterstained with 
hematoxylin for 5 min. 

FISH. FISH was performed using commercially available ALK (Vysis LSI ALK Dual 
Color, Break Apart Rearrangement Probe; Abbott Molecular, Abbott Park, IL) and 
FKHR (Vysis LSI FKHR Dual Color, Break Apart Rearrangement Probe; Abbott 
Molecular) probes according to the manufacturer's instructions. One hundred cells 
were analyzed in each case. FISH was considered positive when more than 15% of the 
tumor cells showed distinct red and green signals and/or a single red (residual 3') 
signal; alternatively, the specimen was classified as FISH negative. 

Biospecimen processing and quality control. Excised tumor tissues were divided 
into 2 pieces. One piece was embedded in optimal cutting temperature compound 
and used to prepare hematoxylin and eosin-stained frozen section slides. The other 
pieces of tissue were snap frozen in liquid nitrogen and stored at — 80' C. The tumor 
cell populations on the frozen section slide accounted for more than 60% of the total 
cell population; less than 10% were necrotic. Genomic DNA was extracted from snap 
frozen tissue and peripheral blood using the QIAmp DNA Mini kit (Qiagen GmbH, 
Hilden Germany) according to the manufacturer's instructions. DNA integrity was 
evaluated using 1% agarose gel electrophoresis. Tumor and normal DNA 
concentrations were measured using PicoGreen dsDNA Quantitation Reagent 



(Invitrogen, Carlsbad, CA). A minimum DNA concentration of 20 ng/|il was 
required for aCGH. 

Exome sequencing and analysis. Genomic DNA was extracted from the blood and 
primary tumor (abdomen) of the RMS patient. Exon capture was performed using 
AgUent SureSelectXT Human All Exon (50 M), which includes all exons annotated in 
the consensus CDS (CCDS) database, as well as 10 bp of flanking sequence for each 
targeted region (http://www.genomics.agilent.com). The captured DNA fragments 
were sequenced with Illumine Hiseq2000, generating 100 bp X 2 paired-end reads. 
The clean reads were aligned against the human reference genome (hgl9/GRCh37) 
using the Burrows -Wheeler Aligner (BWA). The alignment results were further 
processed sequentially using local realignment, duplicate read marking, and base 
quality recalibration by using the Picard (http://picard.sourceforge.net) and GATK 
(http://www.broadinstitute.org/gatk/) pipeline software. Variant and germline 
calling were performed using JointSNVMtx (http://code.google.com/p/joint-snv- 
mtx/), and the somatic mutations observed in tumor tissue were annotated using 
ANNOVAR (http://www.openbioinformatics.org/annovar/). 

Quality control, sequence alignment, somatic variant calling, and annotation. In 

the first quality control step, Cutadapt v. 1.0 [1] removed adapter sequences from the 
input fastq sequence. After adapter trimming, Fastx v.0.0.13 [2] filtered low-quality 
reads, such that base quality was more than 20 and the proportion of good-quality 
bases in each read was more than 50%. Finally, cmpFastq [3] classified paired-end 
reads and single-end reads. Classified fastq sequences were aligned to the human 
reference sequence (hgl9) using the Burrow- Wheeler Aligner v.0.5.9 (BWA) [4], and 
were then merged to a BAM file. Subsequently, sequential cleanup processes, 
consisting of the addition or replacement of read groups, marking and removing 
duplicates, and fixing mate information were performed using Picard Tools v. 1 .69 [5] . 
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Present case 



NCIH3122: EML4-ALK positive (VI) 
NCIH2228: EML4-ALK positive (V3a/b) 
A549: EML4-4LK negative 



Figure 3 | ALK expression and fusion assay. (A) . ALK immunohistochemistry (2A- 1 , positive in the present case, 2A-2, negative control) and ALK FISH 
(2A-3) of the primary tumor: ALK protein overexpression was confirmed, but ALK rearrangement was not detected. (B). AliCRNA expression in tumor 
cells from pleural metastasis (sample#2): ALiC RNA overexpression was detected. The AL.K' expression has been normalized to that of 4 housekeeping 
genes. Lung cancer cell lines (NCIH3122, NCIH2228, and A549) were used as controls for AIX' RNA expression and £MI-ALiC fusion detection. 
(C). ALK fusion assay using NanoString: no EML4-ALK RNA was detected in the sarcoma specimen. 



The cleaned bam file was then sorted using Samtools v.OT.18 [6] and the local 
realignment and base quality score recalibration were processed using the Genome 
Analysis Toolkit v. 1.6-7 (GATK) [7]. 

Somatic mutations were designated into 3 categories: single nucleotide poly- 
morphisms (SNPs), indels, and CNVs. We began by applying the join t_snv_mix_one 
model in JointSNVMix v.0.7.5 [8] in order to find point mutations, and used Annovar 
[9] , Mutation Assessor [10], and SIFT [11] for annotation. Annovar performed filter- 
based annotation indicating mutations that are present in 1000 genome projects or 
dbSNP {snpl35). It also performed gene-based annotation using Mutation Assessor 
and SIFT to identify whether protein- coding changes caused by SNPs or CNVs are 
deleterious. We selected genes that were annotated as "medium or high functional 



impact" by Mutation Assessor and were predicted as "damaging" by SIFT. Indels 
were detected by the SomaticIndelDetector in GATK, following which Annovar gene- 
based annotation was used to describe the functional impact of somatic indels. CNVs 
were detected using ExomeCNV {R package) from the coverage file prepared using 
DepthOfCoverage in GATK. We used default parameters, except for the aforemen- 
tioned software. 

aCGH. Genomic DNA was extracted from the cells cultured from the 
primary tumor of the patient. aCGH was performed using the Agilent Human 
Genome CGH Microarray Kit 8 X 60 K, which contains approximately 45,000 
probes. 
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ALK fusion transcript assay. nCounter assays were performed in duplicate, 
according to the manufacturer's instructions (NanoString Technologies, Inc, Seattle, 
WA, USA). Briefly, 500 ng of total RNA was hybridized to nCounter probe sets for 
16 h at 65"C. Samples were processed using an automated nCounter Sample Prep 
Station (NanoString Technologies, Inc). Cartridges containing immobilized and 
aligned reporter complexes were subsequently imaged on the nCounter Digital 
Analyzer (NanoString Technologies, Inc), set at 1 155 fields of view. Reporter counts 
were collected using the nSolver analysis software version 1 in NanoString, 
normalized, and analyzed as described below. A detailed description of the assay is 
given elsewhere^^. 
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